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Abstract. 

MUSE, the Multi Unit Spectroscopic Explorer, is a 2nd generation integral-field spectrograph under final 
assembly to see first light at the Very Large Telescope in 2013. By capturing ~ 90000 optical spectra in 
a single exposure, MUSE represents a challenge for data reduction and analysis. We summarise here the 
main features of the Data Reduction System, as well as some of the tools under development by the MUSE 
consortium and the DAHLIA team to handle the large MUSE datacubes (about 4x10* pixels) to recover 
the original astrophysical signal. 
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1 Introduction 

The Multi-Unit Spectroscopic Explorer (MUSE, Bacon et al. 2010") is a second generation instrument to be 



commissionned in 2013 on the Very Large Telescope (VLT, unit telescope UT4). It is an integral-field spectro- 
graph operating in the visible wavelength range with two main modes of operation: the Wide Field Mode (1x1 
arcmin field-of view, 0.2"/pixel) and the Narrow Field Mode (7.5x7.5 arcsec, 0.025"/pixel), both with a spectral 
resolution of 1800-3600. 

The instrument is specifically designed to exploit the capabilities of the VLT Adaptive Optics Telescope 
Facility (AO), which can use four laser guide stars and a natural guide star to apply corrections via a de- 
formable secondary mirror. The main scientific goal of MUSE, to be carried out during the Guaranteed Time 
Observations, will be to study the high redshift Universe through the measurement of the Lyman-a signature 
in distant galaxies (2.8 < z < 6.7). Examples of other science topics include the study of nearby galaxies and 
intermediate redshift groups (up io z = 0.8) as well as globular clusters. 

In order to fully cover such a large field-of view, the instrument is composed of 24 integral-field units, each 
observing in parallel a 24*^^ of the field. The light entering each unit is sliced and reimaged into 48 pseudo-slits, 
then propagated through a spectrograph before reaching a 4k x 4k detector (Fig. [T]). 

In total, the data acquired by MUSE in a single exposure amounts for 86400 optical spectra, or 3.6x10* 
pixels. A typical deep field observation with MUSE will have to combine ^80 such exposures: this represents a 
challenge for data reduction and analysis. 



2 MUSE data reduction 



A dedicated Data Reduction System (DRS) is being developed by the MUSE consortium under supervision of 
AIP, and will be delivered to ESO together with the MUSE instrument. Most of the functionalities described 
hereafter are now fully functional or at the stage of final tests / improvements ( Weilbacher et alT||2012 ). 

The main task of the DRS is to convert the raw data coming from the 24 CCDs and create a fully calibrated 
datacube (in spatial, wavelength and flux coordinates), corrected for all sorts of instrumental and atmospheric 
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Fig. 1. (adapted from Weilbacher et al"]|2009 | Splitting of MUSE data: example given for the case of the Wide-Field 
Mode (WFM). The field splitter separates the 1x1 arcmin field-of-view into 24 sub-fields (top-left), each entering a 
separate IFU and sliced into 48 pseudo-slits (top-right). The various spectra produced are imaged onto a 4k x 4k 
detector (bottom). 



effects. The system is designed to be embedded into the ESO data flow system and work as an automated 
pipeline interfaced with the ESO tools esorex and gasgano, as for other second-generation instruments (e.g. 
XShooter). It is written in the C language using the ESO Common Pipeline Library. 

In order to reach the best quality and reliability of the reduced data, only one interpolation step is performed 
at the end of the reduction to produce the final datacube. During the intermediate steps, the information from 
the 24 CCDs is propagated, for each pixel, in a master table called pixel table. Another important goal of the 
DRS is to propagate, throughout the data reduction steps, the error information corresponding to each pixel. 

2.1 Data Reduction Cascade 

The first steps of the DRS perform a classical reduction of each of the 24 CCDs: 

• Creation of master bias, dark and fiat-field frames. 

• Creation of master tables for geometrical, tracing and wavelength calibrations. 

• Then each science frame is corrected for bias, dark, fiat and the pixel table is created with the pixel values 
and the corresponding positions and wavelengths. 

In the second part of the data reduction cascade, the sky subtraction is performed on the entire pixel 
table, then the flux and astrometric calibrations are computed. Finally, the datacube (2 spatial axes and one 
wavelength axis) is constructed by a single interpolation of the pixel table. 

2.2 Sky Subtraction 

Sky subtraction is a very important step in the reduction of datacubes, especially in the red part of the spectrum 
where a large number of atmospheric emission lines dominate the background noise. This is also the region 
where MUSE will observe faint and distant Lyman-a emitters. 
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The current approach used in the DRS for sky subtraction is based on the modelling of the night sky (before 
resampling) into emission lines and continuum in single slices. This assumes a solid knowledge of the Line 
Spread Function (LSF) for each slice, which is modelled following the wavelength calibration using the brightest 
and most isolated lines of the arc exposures. More details on this sky subtraction method are presented in 



Streicher et al. (2011). 



An alternative sky subtraction procedure is currently being tested. The approach used is perpendicular to 
the previous one, in the sense that the sky fit and subtraction is performed in the spatial (rather than the 
spectral) direction. Bright continuum objects, emission lines and cosmics are iteratively masked and removed 
from the fit. The main advantages of this classical sky subtraction is the low computing cost, and the fact that 
it does not depend on the knowledge of the LSF. However, it could be more vulnerable to the presence of bright 
objects in the case of crowded fields. 



3 MUSE data analysis 



3.1 Fusion of MUSE data 

One of the main difficulties of the MUSE data analysis is to combine, in an optimal way, the signal obtained 
in multiple exposures taken under different atmospheric (seeing, transparency, sky variation) and instrumental 
(LSF, field location and orientation) conditions. In the DRS, this task is performed by directly interpolating 
multiple pixel tables into a single combined datacube during the final reduction step. 

Alternatively, a more complex fusion task (named HyperFusion) has been developed in the DAHLIA group 
(Petremand et al. 2011a). A combined datacube is reconstructed by maximizing, in an optimal Bayesian context. 



a posterior probability built from the whole set of observations and their acquisition parameters. 

Both the spatial (PSF) and spectral (LSF) responses are taken into account in this inverse problem. Com- 



pared to the "direct" approach, the fusion clearly improves the resolution of astrophysical sources (Figure 3.1 ), 



but this is a time-consuming process: combining 10 exposures take up to 5 days, compared to 2 hours with the 
DRS. 




Fig. 2. Result of the fusion of MUSE datasets for 2 re- 
solved sources using the direct method (left images) or 
the Bayesian fusion (right images). 
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3.2 Visualisation of MUSE data 



A dedicated tool, called QuickViz ( Petremand et al.|2011b| f^ has been developed to help the MUSE users analyse 
such large datacubes. QuickViz is designed as a plugin for Aladin and is written in the Java language. 
Specific features of QuickViz are the following: 

• Coupled navigation between the spatial and spectral axes thanks both to calibrated cursors. 

• Full use of multi-core architectures to load and handle large datacubes as well as extract spectra from. 

• Data visualisation through multiple views, selections and simple processing algorithms. 

• Visualisation of the associated variance to a datacube in the form of an animation. 



* publically available at 



http : //Isiit-miv . u- strasbg . f r/paseo/cubevisualization . php 
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3.3 Source detection and extraction 

The MUSE datacubes will contain a large number of sources (up to a few thousands in the deepest fields), 
appearing through their spectral continuum over a large wavelength range or through emission lines (only 
detected on a few wavelength slices). Several data analysis tools are currently being developed in order to 
produce a clean catalog of all detected objects in a given datacube: 

1. A classical approach uses an image analysis software (such as SExtractor) to produce a catalog of contin- 
uum sources from the white-light image (obtained by collapsing the full datacube) as well as emission line 
sources (detected on narrow-band images over a few wavelength slices). 

2. A similar approach performs the fitting and subtraction of all continuum sources before searching for 
emission lines by cross-correlation of the full datacube with a simple line model. 

3. Alternatively, more complex data mining techniques are used to "denoise" the full datacube and iden- 
tify sources based on a simple dictionary of spectral shapes (Bourguignon et al. 2010[ Fig. |3|. These 



segmentation techniques can more easily de-blend overlapping sources. 



4. Finally, a 'perpendicular' technique is the use of point marked processes ( Chatelain et al.|2010 | to identify 
sources assuming a simple morphological profile (elliptical shape, Sersic). 
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Fig. 3. Left: Example of "dictionary" of spectral shapes used in the segmentation technique: a given source spectrum 
is fitted by the linear combination of such shapes. Right: Example of datacube segmentation: sources are colored 
according to their spectrum. 
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